Basic level scene understanding: categories, attributes and structures
نویسندگان
چکیده
A longstanding goal of computer vision is to build a system that can automatically understand a 3D scene from a single image. This requires extracting semantic concepts and 3D information from 2D images which can depict an enormous variety of environments that comprise our visual world. This paper summarizes our recent efforts toward these goals. First, we describe the richly annotated SUN database which is a collection of annotated images spanning 908 different scene categories with object, attribute, and geometric labels for many scenes. This database allows us to systematically study the space of scenes and to establish a benchmark for scene and object recognition. We augment the categorical SUN database with 102 scene attributes for every image and explore attribute recognition. Finally, we present an integrated system to extract the 3D structure of the scene and objects depicted in an image.
منابع مشابه
Building a Taxonomy of Attributes for Fine-Grained Scene Understanding
This paper presents the first effort to discover and exploit a diverse taxonomy of scene attributes. Starting with the fine-grained SUN database, we perform crowd-sourced human studies to find over 100 attributes that discriminate between scene categories. We construct an attributelabeled dataset on top of the SUN database [7]. This “SUN Attribute database” spans more than 700 categories and 14...
متن کاملTransient Attributes or High-Level Understanding and Editing of Outdoor Scenes
We live in a dynamic visual world where the appearance of scenes changes dramatically from hour to hour or season to season. In this work we study “transient scene attributes” – high level properties which affect scene appearance, such as “snow”, “autumn”, “dusk”, “fog”. We define 40 transient attributes and use crowdsourcing to annotate thousands of images from 101 webcams. We use this “transi...
متن کاملBridging the Semantic Gap : Image and video Understanding by Exploiting Attributes
Title of dissertation: BRIDGING THE SEMANTIC GAP : IMAGE AND VIDEO UNDERSTANDING BY EXPLOITING ATTRIBUTES Xiaodong Yu, Doctor of Philosophy, 2013 Dissertation directed by: Professor Yiannis Aloimonos Department of Electrical and Computer Engineering Understanding image and video is one of the fundamental problems in the field of computer vision. Traditionally, the research in this area focused ...
متن کاملSceneNet: A Perceptual Ontology for Scene Understanding
Scene recognition systems which attempt to deal with a large number of scene categories currently lack proper knowledge about the perceptual ontology of scene categories and would enjoy significant advantage from a perceptually meaningful scene representation. In this work we perform a large-scale human study to create “SceneNet”, an online ontology database for scene understanding that organiz...
متن کاملConstrained Semi-Supervised Learning Using Attributes and Comparative Attributes
We consider the problem of semi-supervised bootstrap learning for scene categorization. Existing semi-supervised approaches are typically unreliable and face semantic drift because the learning task is under-constrained. This is primarily because they ignore the strong interactions that often exist between scene categories, such as the common attributes shared across categories as well as the a...
متن کامل